Adaptive compression

ABSTRACT

A method, system, and apparatus are directed towards compression of content. A portion of content may be compressed using a compression mode. One or more criteria may be evaluated. Based on the evaluated criteria, a decision is made as to whether to select a different compression mode. If selected, the different compression mode may be used to compress another portion of the content. Additional compression modes may be selected and used to compress the content.

CROSS-REFERENCE TO RELATED APPLICATIONS

The present application is a Continuation of U.S. application Ser. No. 11/376,913 filed Mar. 16, 2006, which claims priority from provisional application Ser. No. 60/748,052, entitled “Adaptive Compression,” filed on Dec. 7, 2005; the benefit of the earlier filing date of which is hereby claimed under 35 U.S.C. §119 (e) and 37 C.F.R. §1.78, and which is further incorporated by reference.

FIELD OF THE INVENTION

The invention relates generally to communicating content over a network, and more particularly but not exclusively to adaptively compressing content that may be communicated over a network.

BACKGROUND OF THE INVENTION

According to some studies, the volume of information over a network, such as the Internet, is expected to more than triple over the next three years. Data and content is likely to remain the largest percentage of Internet traffic, with the majority of this information being dynamic. Often, the issues of concern with Internet traffic range from business to consumer response and order times, to the time required to deliver business information to a traveler using a wireless device, to the download time for rich media such as music, videos, and so forth. Thus, not surprisingly, a major complaint among Internet users is a lack of speed. Additionally, users' complaints often center on how long it takes to display a web page, or other content, on their computing device. One solution therefore, may be to send less data. This is where compression may help.

One way that this is performed by compressing data being sent from a server, and having a client's browser decompress this data upon receipt, thereby reducing the amount of data sent over the Internet, and increasing a web page display speed. Many, although not all, browsers are now equipped to support the Hypertext Transfer Protocol (HTTP) modes that enable compression.

A number of compression algorithms are available for compressing data. The Lempel-Ziv (LZ) compression methods are a set of well-known compression algorithms. DEFLATE is a variation on LZ, and is used in the PKZIP, gzip, and PNG compression algorithms. Lempel-Ziv-Welch (LZW) and LZ-Renau (LZR) are also known compression algorithms. There exist other compression algorithms as well. Some compression algorithms can be performed with one or more parameters that affect one or more of the speed at which the algorithm executes, the amount of memory used during compression, or the data compression ratio. As used herein, the term “compression mode” refers to a combination of a compression algorithm and parameters or factors that distinguish the resulting compressed data. For example, gzip can be invoked with an optional parameter that indicates a relative speed that the algorithm is to be performed. Thus, “gzip-1 generally executes at a higher speed than “gzip-9.” Typically, though not always, the slower mode of gzip results in a higher compression ratio. Each of the allowable speed designations is therefore considered to be a different compression algorithm as used herein.

Some devices are configured to perform a specific compression mode. However, computer systems are complex, and this does not always produce the optimal results. Thus, it is with respect to these considerations and others that the present invention has been made.

BRIEF DESCRIPTION OF THE DRAWINGS

Non-limiting and non-exhaustive embodiments of the invention are described with reference to the following drawings. In the drawings, like reference numerals refer to like parts throughout the various figures unless otherwise specified.

For a better understanding of the invention, reference will be made to the following Detailed Description of the Invention, which is to be read in association with the accompanying drawings, wherein:

FIGS. 1 a and 1 b show functional block diagrams illustrating environments for practicing the invention;

FIG. 2 shows one embodiment of a network device that may be included in a system implementing the invention;

FIG. 3 illustrates a logical flow diagram generally showing one embodiment of a process for accelerating compression of content on the fly using a buffer;

FIG. 4 illustrates a logical flow diagram generally showing one embodiment of a process for compressing the content into the buffer, in accordance with the invention; and

FIG. 5 illustrates a logical flow diagram generally showing one embodiment of a process for compressing content, in accordance with the invention.

DETAILED DESCRIPTION OF THE INVENTION

The invention now will be described more fully hereinafter with reference to the accompanying drawings, which form a part hereof, and which show, by way of illustration, specific exemplary embodiments by which the invention may be practiced. This invention may, however, be embodied in many different forms and should not be construed as limited to the embodiments set forth herein; rather, these embodiments are provided so that this disclosure will be thorough and complete, and will fully convey the scope of the invention to those skilled in the art. Among other things, the invention may be embodied as methods or devices. Accordingly, the invention may take the form of an entirely hardware embodiment, an entirely software embodiment or an embodiment combining software and hardware aspects. The following detailed description is, therefore, not to be taken in a limiting sense.

Throughout the specification and claims, the following terms take the meanings explicitly associated herein, unless the context clearly dictates otherwise. The phrase “in one embodiment” as used herein does not necessarily refer to the same embodiment, though it may. As used herein, the term “or” is an inclusive “or” operator, and is equivalent to the term “and/or,” unless the context clearly dictates otherwise. The term “based on” is not exclusive and allows for being based on additional factors not described, unless the context clearly dictates otherwise. In addition, throughout the specification, the meaning of “a,” “an,” and “the” include plural references. The meaning of “in” includes “in” and “on.”

Briefly stated, the invention is directed towards a system, apparatus, and method for managing compression of data. One aspect of the invention includes, after compressing a portion of the data, determining whether one or more criteria have been satisfied. If one or more of the criteria has been satisfied, a different compression mode may be selected and used for compressing an additional portion of the data. This process may be repeated, resulting in two or more compression modes being used to compress the data. One of the compression modes used may be an identity compression mode. Identity compression mode arises where the content is encoded as uncompressed blocks, but may maintain compression mode type of headers, and/or similar packet wrappers. Identity compression mode may therefore represent a zero compression. For example, using the zlib library, identity compression may be ‘forced.’ As used herein, the term “compression” refers to the processing of data into a compression format, even if the compressed size is not reduced from the original size. If the identity compression mode is used, a new content length may be determined based in part on the length of the compressed content and the remaining content to be encoded. The buffered content and the new content length may then be forwarded in response to the request.

The criteria that are evaluated may related to the system load or to observed results of compression already performed. System load may include metrics such as a percentage of CPU processing power in use, a percentage of memory in use, or amounts of other resources used or available for use.

In one embodiment, a request for content may be provided to a server over a network, using a protocol such as HTTP. The content may include content length information, such as within a protocol header. In one embodiment, a portion of the content may be compressed to approximately fill a buffer using a selected compression mode. In one embodiment, the selected compression mode employs maximum compression on the portion of the content. Maximum compression may be defined as a highest level of compression selectable for a given compression mechanism, and/or a content type. Thus, for example, employing gzip, a maximum level of compression (for example, gzip-9) may result in using an iterative dynamic programming algorithm to compress the portion of the content.

Illustrative Operating Environment

FIG. 1 a illustrates an environment in which the invention may operate. However, not all of these components may be required to practice the invention, and variations in the arrangement and type of the components may be made without departing from the spirit or scope of the invention.

As shown in the figure, system 100 includes client device 102, network 105, traffic management device (TMD) 106, and servers 108-110. Client device 102 is in communication with TMD 106 through network 105. TMD 106 is in further communication with servers 108-110. Although not shown, TMD 106 may be in communication with servers 108-110 through a network infrastructure that is similar to network 105. For example, in one embodiment, TMD 106, and servers 108-110 might reside within a common local area network type of infrastructure, although the invention is not constrained to such a configuration.

Generally, client device 102 may include virtually any computing device capable of connecting to another computing device and receiving information. Such devices may also include portable devices such as, cellular telephones, smart phones, display pagers, radio frequency (RF) devices, infrared (IR) devices, Personal Digital. Assistants (PDAs), handheld computers, wearable computers, tablet computers, integrated devices combining one or more of the preceding devices, and the like. Client device 102 may also include other computing devices, such as personal computers, multiprocessor systems, microprocessor-based or programmable consumer electronics, network devices, and the like. As such, client device 102 may range widely in terms of capabilities and features. For example, a client device configured as a cell phone may have a numeric keypad and a few lines of monochrome LCD display on which only text may be displayed.

In another example, a web-enabled client device may have a touch sensitive screen, a stylus, and several lines of color LCD display in which both text and graphics may be displayed. Moreover, the web-enabled client device may include a browser application enabled to receive and to send wireless application protocol messages (WAP), and/or wired application messages, and the like. In one embodiment, the browser application is enabled to employ HyperText Markup Language (HTML), Dynamic HTML, Handheld Device Markup Language (HDML), Wireless Markup Language (WML), WMLScript, JavaScript, EXtensible HTML (xHTML), Compact HTML (CHTML), and the like, to display and send a message. Moreover, the browser application may also be enabled to receive compressed files using any of a variety of compression mechanisms, including, but not limited to gzip as described in Request for Comments (RFC) 1952 (available at http://www.ietf.org/rfc/rfc1952.txt?number=1952), deflate as described in RFC 1951 (available at http://www.ietf.org/rfc/rfc1951.txt?number=1951), zlib as described in RFC 1950 (available at http://www.ietforg/rfc/rfc1950.txt?number=1950), each of which is incorporated by reference, as well as other mechanisms, such as glib-deflate, or the like.

Client device 102 also may include at least one client application that is configured to receive content from another computing device. The client application may include a capability to provide and receive textual content, graphical content, audio content, alerts, messages, and the like. Moreover, client device 102 may be further configured to communicate a message, such as through a Short Message Service (SMS), Multimedia Message Service (MMS), instant messaging (IM), internet relay chat (IRC), mIRC, Jabber, and the like, between another computing device, and the like.

In one embodiment, client device 102 may be configured such that an end-user may operate the computing device to make requests for data and/or services from other computers on the network. In one embodiment, client device 102 may employ a network interface unit (sometimes called a transceiver), such as described below, to communicate information with another computing device. Often, the requested data resides in computing devices such as servers 108-110. In this specification, the term “client” refers to a computer's general role as a requester of data or services, and the term “server” refers to a computer's role as a provider of data or services. In general, it is possible that a computer can act as a client, requesting data or services in one transaction and act as a server, providing data or services in another transaction, thus changing its role from client to server or vice versa. In one embodiment, client device 102 is a computing device that is not operated by an end-user.

Network 105 is configured to couple one computing device with another computing device. Network 105 may be enabled to employ any form of computer readable media for communicating information from one electronic device to another. Also, network 105 can include the Internet in addition to local area networks (LANs), wide area networks (WANs), direct connections, such as through a universal serial bus (USB) port, other forms of computer-readable media, or any combination thereof. On an interconnected set of LANs, including those based on differing architectures and protocols, a router acts as a link between LANs, enabling messages to be sent from one to another. Also, communication links within LANs typically include twisted wire pair or coaxial cable, while communication links between networks may utilize analog telephone lines, full or fractional dedicated digital lines including T1, T2, T3, and T4, Integrated Services Digital Networks (ISDNs), Digital Subscriber Lines (DSLs), wireless links including satellite links, or other communications links known to those skilled in the art. Furthermore, remote computers and other related electronic devices could be remotely connected to either LANs or WANs via a modem and temporary telephone link.

Network 105 may further include any of a variety of wireless sub-networks that may further overlay stand-alone ad-hoc networks, and the like, to provide an infrastructure-oriented connection. Such sub-networks may include mesh networks, Wireless LAN (WLAN) networks, cellular networks, and the like. Network 105 may also include an autonomous system of terminals, gateways, routers, and the like connected by wireless radio links, and the like. These connectors may be configured to move freely and randomly and organize themselves arbitrarily, such that the topology of network 105 may change rapidly.

Network 105 may further employ a plurality of access technologies including 2nd (2G), 2.5, 3rd (3G), 4th (4G) generation radio access for cellular systems, WLAN, Wireless Router (WR) mesh, and the like. Access technologies such as 2G, 3G, and future access networks may enable wide area coverage for mobile devices with various degrees of mobility. For example, network 105 may enable a radio connection through a radio network access such as Global System for Mobile communication (GSM), General Packet Radio Services (GPRS), Enhanced Data GSM Environment (EDGE), Wideband Code Division Multiple Access (WCDMA), CDMA2000, and the like. In essence, network 105 may include virtually any wired and/or wireless communication mechanisms by which information may travel between one computing device and another computing device, network, and the like.

Additionally, communication media typically embodies computer-readable instructions, data structures, program modules, or other data in a modulated data signal such as a carrier wave, data signal, or other transport mechanism and includes any information delivery media. The terms “modulated data signal,” and “carrier-wave signal” includes a signal that has one or more of its characteristics set or changed in such a manner as to encode information, instructions, data, and the like, in the signal. By way of example, communication media includes wired media such as twisted pair, coaxial cable, fiber optics, wave guides, and other wired media and wireless media such as acoustic, RF, infrared, and other wireless media.

TMD 106 may include virtually any device that manages network traffic. Such devices include, for example, routers, proxies, firewalls, load balancers, cache devices, devices that perform network address translation, any combination of the preceding devices, and the like. TMD 106 may, for example, control the flow of data packets delivered to and forwarded from an array of servers, such as servers 108-110. TMD 106 may direct a request for a resource to a particular server based on network traffic, network topology, capacity of a server, content requested, and a host of other traffic distribution mechanisms. TMD 106 may receive data packets from and transmit data packets to the Internet, an intranet, or a local area network accessible through another network. TMD 106 may recognize packets that are part of the same communication, flow, and/or stream and may perform special processing on such packets, such as directing them to the same server so that state information is maintained. TMD 106 also may support a wide variety of network applications such as Web browsing, email, telephony, streaming multimedia and other traffic that is sent in packets.

TMD 106 may receive requests from client device 102. TMD 106 may select a server from servers 108-110 to forward the request. TMD 106 may employ any of a variety of criteria and mechanisms to select the server, including those mentioned above, load balancing mechanisms, and the like. TMD 106 is further configured to receive a response to the request and to forward the response to client device 102.

TMD 106 may also perform adaptive compression of at least a portion of the response to the request as it is being forwarded to client device 102. TMD 106 may select a compression mode to initially compress at least a portion of the response, and based on the results of the compression of the portion, employ an identity compression mode to compress the remaining portion of the response. TMD 106 may also employ processes substantially similar to those described below in conjunction with FIGS. 3-4 to perform at least some of its actions.

TMD 106 may be implemented using one or more personal computers, servers, multiprocessor systems, microprocessor-based or programmable consumer electronics, network PCs, radio frequency (RF) devices, infrared (IR) devices, integrated devices combining one or more of the preceding devices, and the like. Such devices may be implemented solely in hardware or in hardware and software. For example, such devices may include some application specific integrated circuits (ASICs) coupled to one or more microprocessors. The ASICs may be used to provide a high-speed switch fabric while the microprocessors may perform higher layer processing of packets. An embodiment of a network device that could be used as TMD 106 is network device 200 of FIG. 2, configured with appropriate software. The BIG-IP® family of traffic managers, by F5 Networks of Seattle, Wash., are examples of TMDs.

Servers 108-110 may include any computing device capable of communicating packets with client computer 102 to provide a response to a request for data and/or services. Each packet may convey a piece of information. A packet may be sent for handshaking, i.e., to establish a connection or to acknowledge receipt of data. The packet may include information such as a request, a response, or the like. Generally, packets received by servers 108-110 will be formatted according to TCP/IP, but they could also be formatted using another transport protocol, such as User Datagram Protocol (UDP), Internet Control Message Protocol (ICMP), NETbeui, IPX/SPX, token ring, and the like. Moreover, the packets may be communicated between servers 108-110, TMD 106, and client device 102 employing HTTP, HTTPS, and the like.

In one embodiment, servers 108-110 are configured to operate as a website server. However, servers 108-110 are not limited to web servers, and may also operate a messaging server, a File Transfer Protocol (FTP) server, a database server, content server, and the like. Additionally, each of servers 108-110 may be configured to perform a different operation. Thus, for example, server 108 may be configured as a messaging server, while server 109 is configured as a database server. Moreover, while servers 108-110 may operate as other than a website, they may still be enabled to receive an HTTP communication.

Devices that may operate as servers 108-110 include personal computers desktop computers, multiprocessor systems, microprocessor-based or programmable consumer electronics, network PCs, servers, and the like.

Although the above configuration illustrates compression acceleration being performed within TMD 106, the invention is not so limited. Thus, in one embodiment, the compression acceleration may also be performed with at least one of servers 108-110, and/or a server and TMD 106, without departing from the scope or spirit of the invention.

FIG. 1 b illustrates another environment in which the invention may operate. The environment of FIG. 1 b has similarities to FIG. 1 a, and like-numbered objects are similar to those described with respect to FIG. 1 a. In FIG. 1 b, an additional. TMD 112 is located on the client side of the network and receives data sent to and from the client device 102. In one embodiment, TMD 112 performs decompression of data sent to the client device 102, and sends uncompressed data to the client device 102. The WANJet appliance, by F5 Networks, Inc., is one example of a device used in an environment with a TMD on both the client side and the server side of a network.

In one embodiment of the invention, TMD 112 of FIG. 1 b compresses data being sent from client 102 to one of the servers 108-110. TMD 112 may therefore perform the methods of the invention described herein. In one embodiment, both TMD 106 and TMD 112 perform the methods of the invention, and compression is performed on data being sent in both directions of a communication between client 102 and servers 108-110. Further, the terms client and server as used herein may refer to functional roles of the devices, and a server may function as a client, or a client may function as a server, with respect to performing aspects of the invention. Thus, the invention may be performed in a peer-to-peer environment between client devices or in a web services environment between servers.

Illustrative TMD Environment

FIG. 2 shows an exemplary network device 200 that may operate as TMD 106 or TMD 112 of FIGS. 1 a-b. It will be appreciated that not all components of network device 200 are illustrated, and that network device 200 may include more or less components than those shown in FIG. 2.

As illustrated in FIG. 2, network device 200 includes a central processing unit (CPU) 202, mass memory, and a network interface unit 212 connected via a bus 204. Network interface unit 212 includes the necessary circuitry for connecting network device 200 to various networks, including network 105 of FIGS. 1 a-b, and is constructed for use with various communication protocols including the TCP/IP and UDP/IP protocol. Network interface unit 212 may include or interface with circuitry and components for transmitting messages and data over a wired and/or wireless communications medium. Network interface unit 212 is sometimes referred to as a transceiver, Network Interface Card (NIC), or the like.

The mass memory generally includes random access memory (“RAM”) 206, read-only memory (“ROM”) 214, and one or more permanent mass storage devices, such as hard disk drive 208. The mass memory stores operating system 216 for controlling the operation of network device 200. The operating system 216 may comprise an operating system such as UNIX, LINUX™, or Windows™. In one embodiment, the mass memory may store program code and data for implementing Adaptive Compression Manager (ACM) 220, and buffer 222.

Buffer 222 includes virtually any mechanism that is configured to receive and store content, including compressed content. In one embodiment, buffer 222 is illustrated within RAM 206; however, the invention is not so limited. For example, buffer 222 may also reside within disk drive 208, or the like, without departing from the scope of the invention. In one embodiment, buffer 222's size is about 16K bytes, however, the invention is not so constrained to this size, and buffer 222 may have virtually any size based, in part, on any of a variety of criteria. For example, buffer 222's size may be selected based on information about a typical size of files sent from a server.

ACM 222 may be configured to manage adaptive compression of content over a network. ACM 222 may receive a file from another device. Such files may include any of a variety of data and data formats, including documents, applications, movies, videos, music, audios, still images, text, graphics, and so forth.

ACM 222 may select to compress at least a portion of the received file. In one embodiment, ACM 222 may select a compression mode to compress at least the portion of the file into a temporary store, such as buffer 222, or the like. Then, based on whether the file can be completely compressed into buffer 222, ACM 222 may select to employ another compression mode, such as the identity compression mode, to compress a remaining portion of the file, where the remaining portion is that portion of the file that remains after the buffer is approximately full with compressed data from the first portion. Moreover, ACM 222 may adjust a value that indicates a length of the file. The value may be located in a protocol header. In one embodiment, the value is a content length value that is capable of being included in a protocol header such as the HTTP Content-Length header field. The HTTP Content-Length header field is described in RFC 2616, available at http://www.ietf.org/rfc/rfc2616.txt. ACM 222 may employ processes such as those described below in conjunction with FIGS. 3-4 to perform at least some of its actions.

In one embodiment, network device 200 includes one or more Application Specific Integrated Circuit (ASIC) chip 226 connected to bus 204. As shown in FIG. 2, network interface unit 212 may connect to 204 bus through at least one ASIC chip. ASIC chip 226 can include logic that enables at least some of the actions of network device 200. For example, in one embodiment, ASIC chip 226 can be employed to perform packet processing functions on incoming and/or outgoing packets. In one embodiment, ASIC chip 226 performs logic for network monitor 220 and/or redirect coordinator 224. In one embodiment, network device 200 includes one or more field-programmable gate arrays (FPGA) (not shown), instead of, or in addition to, ASIC chip 226. A number of actions for the network device can be performed by ASIC chip 226, an FPGA, CPU 202 with instructions stored in memory, or any combination of actions performed by the ASIC chip, FPGA, and CPU.

Computer storage media may include volatile and nonvolatile, removable and non-removable media implemented in any method or technology for storage of information, such as computer readable instructions, data structures, program modules or other data. Examples of computer storage media include RAM 206, ROM 214, EEPROM, flash memory and/or any other memory architecture, CD-ROM, digital versatile disks (DVD) and/or any other optical storage, magnetic cassettes, magnetic tape, magnetic disk storage and/or any other magnetic storage devices, and/or any other medium that can store information that can be accessed by a computing device.

Network device 200 may also include an input/output interface (not shown) for communicating with external devices or users.

Network device 200 can also be implemented as one or more “blades” where the term “blade” refers to one of multiple electronic circuit boards or cards that are installed in a hardware chassis with a backplane. An exemplary blade may include one or more processors, volatile and non-volatile memory, interfaces suitable for communicating information to and from the blade, and other components for enabling the operation of one or more applications. A blade may also include a specialized interface for the backplane and other interfaces, such as a USB port, FIREWIRE port, serial port, RF interface, IR interface, Ethernet interface, IDE controller, and the like. An application running on a blade may employ any of these interfaces to communicate information to other applications running on other blades and/or devices coupled to the blade server. Network device 200 can also be implemented as a combination of blades and additional components in the chassis. The BIG-IP Traffic Manager and the WANJet Appliance, produced by F5 Networks, Inc., are examples of network devices that may employ the present invention.

Generalized Operation

The operation of certain aspects of the invention will now be described with respect to FIGS. 3-4. FIG. 3 illustrates a logical flow diagram generally showing one embodiment of a process for accelerating compression of content using a buffer.

Process 300 of FIG. 3, begins, after a start block, at block 302 where a file is received, including information associated with the file, such as its size. Processing flows next to block 304, which is described in more detail below in conjunction with FIG. 4. Briefly, however, at block 304 a compression mode is selected and is employed to attempt to compress the file into a buffer. The result of block 304 is that either the complete file may be compressed into the buffer using the selected compression, or the buffer size is determined to have been exceeded before the complete file is compressed into the buffer. In either event, processing then flows to decision block 306.

At decision block 306 a determination is made whether the complete file is compressed into the buffer. In one embodiment, the buffer may be about 16K bytes. However, the invention is not limited to this size, and the buffer size may be set to virtually any size, without departing from the scope or spirit of the invention. If the complete file is compressed into the buffer, processing branches to block 308; otherwise, processing flows to block 310.

At block 308, an adjusted file length based on the selected compression is determined. In one embodiment, the adjusted file length may be determined as: ADJUSTED FILE LENGTH=C+T, where C represents the compressed size of the data processed, which in this instance is the entire compressed file; and T represents a size of any compression trailers that may be included. The adjusted file length may then be employed to modify length information in a header, such as HTTP content-length information in a protocol header, or the like, that is associated with the compressed file.

At block 308, the compressed file, including the adjusted file length is then sent to a destination. Process 300 of FIG. 3 then returns to a calling process to perform other actions.

At block 310, however, it is determined that the file is not completely compressed. This may arise, for example, where the buffer size is exceeded before the entire file may be compressed into the buffer. Thus, at block 310, an expected file length of the file based on another compression mode is determined. In one embodiment, the other compression mode is the identity compression mode. In one embodiment, the expected file length may be determined using the following: EXPECTED FILE LENGTH=C+T+(Roundup(R/B)*O)+R, where C represents the size of the processed (compressed) data (which may be approximately the size, of the buffer); T represents the size of any compression trailers; and R represents the amount of data in the file remaining that is to be compressed using the identity compression. That is, R=S−(the amount of data compressed so far), where S then represents the size of the original file. B represents a block size into which another portion of the file is compressed. In one embodiment, B may be about 4096 bytes. However, virtually any block size may be employed. For example, in one embodiment, B is about the same size as the buffer. In addition, O represents a size of a block header. In one embodiment, O is about 5 bytes. The file length that is determined at block 310 is termed an expected file length at least because, at the time that the file length is determined, a remaining portion exists that has not yet been processed into compression blocks.

Processing then continues to block 312, wherein the compressed portion of the file in the buffer, including the expected file length is sent to the destination. Next, processing continues to block 314, where the remaining portion of the file that exceeded the buffer is then compressed using the second compression mode. In one embodiment, the second compression mode is the identity compression mode. In one embodiment, the uncompressed portion of the file may be compressed using fixed-size blocks of size B, as noted above. More than one fixed-size block may be employed to complete the compression of the remaining portion of the uncompressed file. In one embodiment, where there are multiple fixed-size blocks, each block is sent to the destination as it is generated, virtually on the fly. In another embodiment, the multiple fixed-size blocks are generated, and then each are sent in turn towards the destination. In any event, upon completion of block 314, process 300 returns to the calling process to perform other actions. In one embodiment, the blocks used are not of a fixed-size.

FIG. 4 illustrates a logical flow diagram generally showing one embodiment of a process for compressing the content into a buffer, in accordance with the invention. In one embodiment, process 400 of FIG. 4 may represent block 304 of FIG. 3.

Process 400 of FIG. 4, begins, after a start block, at block 402, where a file is received, including information associated with the file such as its size. Processing flows next to block 404 where a compression mode is selected. Selection of the compression mode may include employing various compression mechanisms, including gzip or the like. In one embodiment the selected compression mode may employ a maximum compression. However, the invention is not so constrained.

Processing continues to block 406, where at least a portion of the received file may be compressed into a buffer. In one embodiment, the buffer may be a fixed-size buffer of about 16K bytes. However, the invention is not limited to this size, and may be set to virtually any size, without departing from the scope or spirit of the invention. A fixed-size buffer refers to a buffer having a size that remains approximately the same at least during the process of compressing a file.

Processing then flows to decision block 408 where a determination is made whether the file is done. That is, has the file been completely compressed? If it is determined that the file is completely done, processing returns to a calling process to perform other actions. In one embodiment, the process returns to block 304 of FIG. 3.

However, if at decision block 408, it is determined that the file is incompletely compressed into the buffer, processing continues to decision block 412 where a determination is made whether there remains sufficient unused space within the buffer to attempt to compress another portion of the file into it. Although there may be some unused space within the buffer, it may not be sufficient, due to block sizing, or various other limitations, to enable another portion of the file to be compressed into the buffer. Thus, the invention also takes into account such variables. In any event, if it is determined that there is insufficient space in the buffer into which another portion of the file may be compressed, processing returns to the calling process to perform other actions. In one embodiment, the process returns to block 304 of FIG. 3. However, if it is determined that sufficient space is available into which another portion of the file may be compressed, processing loops back to block 402 to obtain another portion of the file. This process may continue until the buffer is determined to be exceeded or the file is completely compressed into the buffer.

FIG. 5 illustrates a logical flow diagram generally showing one embodiment of a process 500 for compressing content, in accordance with the invention. Process 500 of FIG. 5 begins, after a start block, at block 502, where a file is received. It is to be noted that the term “file” as used herein refers to any quantity of ordered digital data. It may include data that is streamed to the receiver. It may also include the result of processing other data, such as unencrypting encrypted data. Receiving a file may include receiving any portion of a file. Additionally, the action 502 of receiving a file does not need to be completed prior to compressing a portion of the file. In one embodiment, portions of a file are received concurrently with compressing portions of the file, and this may be done synchronously or asynchronously.

Processing flows next to block 504, where a first compression mode is selected for compression of the file contents. At block 506, the selected compression mode is employed to compress a portion of the file. Processing then flows to decision block 508.

At decision block 508, a determination is made whether the complete file has been compressed. If the complete file has been compressed, processing flows to a return block, where the process is completed. It is to be noted that this determination may be affirmative if it is desired that only a portion of a file be compressed, and some portion of the file remain uncompressed.

If, at the decision block 508, it is determined that the complete file has not been compressed, and there is additional content of the file remaining, processing flows to block 510. At block 510, one or more criteria are evaluated. The types of criteria that may be evaluated generally fall into two categories. A first category relates to results of compression that has been performed with the currently selected compression method. A second category relates to system metrics, including system load and available resources. More than one metric may be combined using a formula where each metric is weighted as desired. Further discussion of the criteria and combining criteria appears below.

Processing then flows to decision block 512, where a determination is made of whether a threshold of criteria has been met. If the threshold has been met, processing flows to block 514, where a different compression mode is selected. The compression mode selected at the block 514 is different from the compression mode in use immediately prior to this selection. It may, however, be identical to one previously used during the process 500. For example, the process 500 may employ a first compression mode, then change to a second compression mode at block 514, and during a subsequent execution of block 514 select the first compression mode.

After selection of a compression mode at block 514, processing flows to block 506, where the selected compression mode is used to compress another portion of the file. At this point, the process iteration continues.

If, at the decision block 512, it is determined that the criteria threshold has not been met, processing flows to block 506, where the process iteration continues.

As discussed above, the Criteria evaluated and used in determining whether to select a different compression mode may related to the results of prior compression, or it may related to system metrics. The results of prior compression may include, for example, a data compression ratio. A data compression ratio indicates a reduction in data quantity resulting from a compression algorithm on a specific portion of data. One way to indicate a data compression ratio is by calculating: (((original size)−(compressed size))/(original size))×100% Using this calculation, a high value indicates a high amount of compression has been performed. In one embodiment, a compression ratio below a predetermined value results in a determination to select a different compression mode.

System metrics may include one or more metrics of system load. For example, this may include a ratio of processor power that is being used as compared with total processing power. This may be measured at a point in time, or may be calculated over a period of time. Another system metric may be the ratio of memory that is being used as compared with total memory. Processor power and memory may refer to the device that is performing the compression, such as network device 200 of FIG. 2. In a system having more than one device performing compression, processor power or memory may refer to metrics relating to a combination of such devices.

Any one or more of the metrics may be combined in the calculation of box 510. The metrics may be weighted individually, and they may be normalized so that one metric does not dominate the calculation.

It will be understood that each block of the flowchart illustration, and combinations of blocks in the flowchart illustration, can be implemented by computer program instructions. These program instructions may be provided to a processor to produce a machine, such that the instructions, which execute on the processor, create means for implementing the actions specified in the flowchart block or blocks. The computer program instructions may be executed by a processor to cause a series of operational steps to be performed by the processor to produce a computer implemented process such that the instructions, which execute on the processor to provide steps for implementing the actions specified in the flowchart block or blocks.

Accordingly, blocks of the flowchart illustration support combinations of means for performing the specified actions, combinations of steps for performing the specified actions and program instruction means for performing the specified actions. It will also be understood that each block of the flowchart illustration, and combinations of blocks in the flowchart illustration, can be implemented by special purpose hardware-based systems which perform the specified actions or steps, or combinations of special purpose hardware and computer instructions. It is also understood that the actions described as part of the invention can be performed in various orders. Unless clearly stated or required, the invention is not limited to the ordering of elements in figures as illustrated.

The above specification, examples, and data provide a complete description of the manufacture and use of the composition of the invention. Since many embodiments of the invention can be made without departing from the spirit and scope of the invention, the invention resides in the claims hereinafter appended. 

What is claimed as new and desired to be protected by Letters Patent of the United States is:
 1. A system, comprising: a server device having stored thereon a file; and a first network device having processor that is programmed to perform actions comprising: receiving the file from the server device; compressing a first portion of the file using a first compression mode; evaluating a criteria comprising a normalized weighting of a combination of at least a system load metric and a data compression ratio metric, the normalization being such that no one metric dominates the criteria; selectively selecting a second compression mode and compressing a second portion of the file based on the evaluation of the criteria, wherein the second compression mode is different from the first compression mode; and selectively transmitting to one of a second network device or a client device, an expected file length that is computed based in part on a combination of the compressed portions of the file and an uncompressed portion of the file, the uncompressed portion to be subsequently compressed prior to a subsequent transmission to the second network device or the client device.
 2. The system of claim 1, wherein selectively transmitting to one of a second network device further comprises, the second network device being configured to uncompress the received portions and provide the uncompressed received portions to the client device.
 3. The system of claim 1, wherein the expected file length is transmitted within a protocol header.
 4. The system of claim 1, wherein compressing the first portion further comprises determining a size of the first compressed portion based on a size of a buffer.
 5. The system of claim 1, wherein the expected file length is further determined based on at least one of a compression trailer or a block header size.
 6. The system of claim 1, wherein the data compression ratio metric is determined using the first compression mode.
 7. The system of claim 1, wherein selectively transmitting to one of the client device further comprises, the client device being configured to uncompress the received portions of the file.
 8. A method, comprising: at the first network device: compressing a first portion of a file using a first compression mode; evaluating a criteria comprising a combination of at least a system load metric and a data compression ratio metric; selectively selecting a second compression mode and compressing a second portion of the file based on the evaluation of the criteria, wherein the second compression mode is different from the first compression mode; selectively transmitting the compressed portions and an expected file length that is computed based in part on a combination of the compressed portions of the file and an uncompressed portion of the file; compressing the uncompressed portion as at least a third portion; and transmitting the compressed third portion.
 9. The method of claim 8, further comprising: at a client device: receiving the compressed portions and the expected file length; and generating a replicate of the file by uncompressing the received compressed portions.
 10. The method of claim 8, further comprising: at a second network device having a processor: receiving the compressed portions and the expected file length; generating a replicate of the file by uncompressing the received compressed portions; and transmitting the replicated filed to a client device.
 11. The method of claim 8, wherein the expected file length is transmitted within a protocol header.
 12. The method of claim 8, the transmission of the compressed portions comprises transmitting the portions in network packets.
 13. The method of claim 8, wherein the expected file length is further determined based on at least one of a compression trailer or a block, header size.
 14. The method of claim 8, wherein the data compression ratio metric is determined based on a result of a compression of the first portion of the file.
 15. A system, comprising: a first network device having a processor that is programmed to perform actions comprising: compressing a first portion of a file using a first compression mode; evaluating a criteria comprising a normalized weighting of a combination of at least a system load metric and a data compression ratio metric, the normalization being such that no one metric dominates the criteria; selectively selecting a second compression mode and compressing a second portion of the file based on the evaluation of the criteria, wherein the second compression mode is different from the first compression mode; and selectively transmitting the compressed portions of the file and an expected file length that is based in part on a combination of the compressed portions of the file and an uncompressed portion of the file, the uncompressed portion to be subsequently compressed prior to a subsequent transmission; and a second network device having a processor that is programmed to perform actions comprising: receiving at the transmitted portions of the file and the expected file length; uncompressing the received portions of the file; and transmitting the uncompressed portions to a client device.
 16. The system of claim 15, wherein the system load metric comprises at least one of a ratio of processor power to total processing power; ratio of memory used to total available memory, or a network load.
 17. The system of claim 15, wherein the data compression ratio metric is determined based on a result of a compression of the first portion of the file.
 18. The system of claim 15, wherein the expected file length is further determined based on at least one of a compression trailer or a block header size.
 19. The system of claim 15, wherein selectively selecting a second compression mode further comprises determining the second compression mode based on whether a data compression ratio indicates a reduction in data quantity resulting from using the second compression mode on the second portion of the file.
 20. The system of claim 15, wherein in at least one compression mode selected includes an identity compression mode. 